The Role of Semantics in Automatic Summarization: A Feasibility Study
نویسندگان
چکیده
State-of-the-art methods in automatic summarization rely almost exclusively on extracting salient sentences from input texts. Such extractive methods succeed in producing summaries which capture salient information but fail to produce fluent and coherent summaries. Recent progress in robust semantic analysis makes the application of semantic techniques to summarization relevant. We review in this paper areas in the field of summarization that can benefit from the introduction of the type of semantic analysis that has become available. The main pain points that semantic information can alleviate are: aiming for more fluent summaries by exploiting logical form representation of the source text; identifying salient information and avoiding redundancy by relying on textual entailment and paraphrase identification; and generating a coherent summary while relying on rhetorical structure and discourse structure information extracted from the source documents. In addition, we review the possibility to perform automatic Pyramid evaluation of summarization quality that relies on robust semantic similarity measures.
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملAutomatic Summarization of Meeting Data: A Feasibility Study
The disclosure of audio-visual meeting recordings is a new challenging domain studied by several large scale research projects in Europe and the US. Automatic meeting summarization is one of the functionalities studied. In this paper we report the results of a feasibility study on a subtask, namely the summarization of meeting transcripts. A Maximum Entropy based extractive summarization system...
متن کاملبرچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کامل